Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats

نویسندگان

  • Ali Seman
  • Azizian Mohd Sapawi
  • Mohd Zaki Salleh
چکیده

Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

VNTR9 and VNTR10, two newly-found variable-number tandem repeat loci useful in MLVA genotyping of Bordetella pertussis

Background & Aims: Bordetella pertussis, the causative agent of whooping cough, continues to infect human hosts even in those populations where infants and children are routinely vaccinated. Causes of pertussis epidemiology are not fully identified unless strains of the pathogen are characterized by molecular means. Golbally, Multi Locus Variable Number of Tandem Repeats analysis (MLVA) has pro...

متن کامل

[Kinship determination using DNA markers].

BACKGROUND Autosomal and Y chromosome short tandem repeats (STRs) and mitochondrial DNA polymorphisms are the most commonly used molecular tools for determination of kinship. AIM To report a revision of 1,120 kinship cases (paternity and others) analyzed in our laboratory. MATERIAL AND METHODS Revision of all kinship cases analyzed between years 2001-2006. Autosomal and Y chromosome STRs an...

متن کامل

Genetic analysis of two STR loci (VWA and TPOX) in the Iranian province of Khuzestan

Objective(s): Short tandem repeat (STR) loci are the most informative DNA genetic markers for attempting to individualize biological material for application in paternity and forensic cases. Materials and Methods: Blood samples were collected and the total genomic DNA was extracted. The DNA samples were used for genotyping VWA and TPOX STR loci using PCR and polyacrylamide gel electrophoresis. ...

متن کامل

Segmental Duplications as a Complement Strategy to Short Tandem Repeats in the Prenatal Diagnosis of Down Syndrome

Background: Quantitative fluorescence-polymerase chain reaction (QF-PCR) is an inexpensive and accurate method for the prenatal diagnosis of aneuploidies that applies short tandem repeats (STRs) as a chromosome-specific marker. Despite its apparent advantages, QF-PCR is not applicable in all cases due to the presence of uninformative STRs. This study was carried out to investigate the efficienc...

متن کامل

Genome-Wide Development and Use of Microsatellite Markers for Large-Scale Genotyping Applications in Foxtail Millet [Setaria italica (L.)]

The availability of well-validated informative co-dominant microsatellite markers and saturated genetic linkage map has been limited in foxtail millet (Setaria italica L.). In view of this, we conducted a genome-wide analysis and identified 28 342 microsatellite repeat-motifs spanning 405.3 Mb of foxtail millet genome. The trinucleotide repeats (∼48%) was prevalent when compared with dinucleoti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2015